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Abstract. One autapomorphic character restricted to all 
Metazoa including Porifera |sponges] is the existence of 
transmembrane receptor tyrosine kinases (RTKs). In this 
study we screened for molecules from one subfamily within 
the superfamily of the insulin receptors. The subfamily 
includes the insulin receptors (InsR), the insulin-like growth 
factor I receptors, and the InsR-rclated receptors—all found 
in vertebrates—as well as the InsR-homolog from Drosoph¬ 
ila melanogaster . cDNAs encoding putative InsRs were 
isolated from the hexactinellid sponge Aphroeaflistes vas¬ 
tus, the demosponge Subcrites domuncula, and the calcar¬ 
eous sponge Sycon raphanus. Phylogenetic analyses of the 
catalytic domains of the putative RTKs showed that the 
sponge polypeptides must be grouped with the InsRs. The 
relationships revealed that all sponge sequences fall into one 
branch of this group, whereas related sequences from mam¬ 
mals (human, mouse, and rat), insects and molluscs, and 
polypeptides from one cephalochordate, fall together into a 
second branch. We have concluded that (/) the InsR-like 
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molecules evolved in sponges prior to the “Cambrian Ex¬ 
plosion" and contributed to the rapid appearance of the 
higher metazoan phyla; (//) the sponges constitute a mono- 
phyletic taxon, and (Hi) epidermal growth factor (EGF)-like 
domains are present in sponges, which allows the insertion 
of this domain into potential receptor and matrix molecules. 

Introduction 

The Porifera [sponges] are the oldest metazoan phylum; 
they existed 40 to 50 million years prior to the onset of the 
“Cambrian Explosion" (Valentine ct ah, 1996), the time of 
main divergence of metazoan phyla (Valentine, 1994). 
Highly conserved amino acid (aa) sequences in sponges 
indicate that the Porifera share one common ancestor with 
other metazoan phyla (Muller et ah, 1994; also see Muller, 
1995. 1997. and 1998). These sequences include those (/) 
for transmembrane receptors, e.g., transmembrane tyrosine 
kinase [TK] receptors |RTKs] (Muller and Schiicke, 1996); 
(/7) for transmembrane adhesion molecules, e.g., the inte¬ 
grals (Pancer et ah, 1997a); and (Hi) for G-protein linked 
transmembrane receptors for signaling molecules, e.g., the 
metabotropic glutamate receptor (Perovic et ah, 1999). Ad¬ 
ditional sequences from homeodomain transcription factors 
show that the transcriptional control of gene expression in the 
oldest Metazoa is similar to that of the most recent phyla 
(Seimiya et ah, 1994; Richelle-Maurer et ah, 1998; Coutinho 
et ah, 1998). One metazoan autapomorphic character restricted 
to Porifera is the presence of high telomerase activity in all (or 
almost all) cells, including somatic cells (Koziol et ah, 1998). 

The discovery that sponges contain transmembranc 
(Schacke et ah, 1994a), cytoplasmic (Ottilie et ah, 1992), 
and nuclear TKs (Cetkovic et ah, 1998) suggests that the 
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signaling system in these animals is sophisticated enough to 
respond to peptide growth factors and to cell adhesion 
(Muller and Muller, 1999). The catalytic domain of the 
RTKs is related to that of the cytoplasmic protein tyrosine 
kinases [PTKs] and the Ser/Thr kinases (Hanks and Hunter, 
1995; Kruse et al, 1997). The catalytic domain of the TKs 
is subdivided into 12 smaller subdomains, the first eight of 
which are most highly conserved (Hardie and Hanks, 1995). 
In addition to the characteristic tyrosine protein kinase- 
specific active-site signature, the previously described cat¬ 
alytic domain of the RTK from the demosponge Geodia 
cydonium contains no further site that marks this molecule 
as belonging to a specific class of RTKs (Schacke et al , 
1994a). In this study, we have demonstrated for the first 
time that one distinct subfamily of the RTKs is already 
present in all three classes of Porifera, and that it contains 
the TK class II signature with the consensus pattern 
D-[L1 V]-Y-x 3 -Y-Y-R (PC/GENE, 1995 [Prosite]). By 
choosing appropriate primers for the polymerase chain re¬ 
action, sequences were obtained from sponges that must be 
grouped with the insulin receptors [InsRs] of vertebrates 
(Ullrich et al, 1985), the insulin-like growth factor 1 recep¬ 
tors [1GF-1-Rs] of vertebrates (Ullrich et al, 1986), the 
InsR-related receptors of vertebrates (Shier and Watt. 
1989), and the InsR homolog from Drosophila melano- 
gaster (Fernandez et al, 1995). These molecules are all 
members of the class 11 RTKs which display, within sub- 
domain Vll, the following consensus for InsRs, GF-l-Rs, 
and InsR-homologs R-D-[1V]-Y-E-[TS]-D-Y (Hardie and 
Hanks, J995). 

Here we present the TK domains of lnsR-(like) molecules 
that have been isolated from the hexactinellid sponge 
Aphrocallistes vastus, the demosponge Suberites domun¬ 
cula, and the calcareous sponge Sycon raphanus . From S. 
raphanus, three full-length clones from the InsR-like mol¬ 
ecules are given. All of the sequences were used for phy¬ 
logenetic analyses. These revealed that the sponge InsR-like 
molecules are statistically significantly distinct from the 
related molecules of higher Metazoa, and allowed an as¬ 
sessment of the evolutionary order in which the three 
classes of Porifera appeared. 

Materials and Methods 

Materials 

Restriction endonucleases and other enzymes for recom¬ 
binant DNA techniques and vectors were obtained from 
Stratagene (La Jolla, CA; USA), Q1AGEN (Hilden; Ger¬ 
many), Boehringer Mannheim (Mannheim; Germany), 
GibcoBRL (Grand Island, NY; USA), Amersham (Buck¬ 
inghamshire; UK), USB (Cleveland, OH; USA), DUPONT 
(Bad Homburg; Germany), Epicentre Technologies (Madi¬ 
son, WI; USA), and Promega (Madison, Wl; USA). Taq 
DNA polymerase, DIG [digoxigenin] DNA labeling kit. 


DIG-11-dUTP, anti-DIG AP Fab fragments, and CDP 
[disodium 2-chloro-5-(4-methoxyspiro{ 1,2-dioxetane-3,2'- 
(5'-chloro)-tricyclo[3.3.1.1 37 ]decan}-4-yl)phenyl phosphate] 
were from Boehringer Mannheim (Mannheim; Germany). 

Sponges 

Live specimens of Sycon raphanus [Schmidt] (Porifera, 
Calcarea, Calcaronea, Leucosoleniida, Sycettidae) and 
Suberites domuncula [Olivi] (Porifera, Demospongiae, Tet- 
ractinomorpha, Hadromerida, Suberitidae) were collected 
from the Adriatic Sea near Rovinj (Croatia). The specimens 
of Aphrocallistes vastus [Schulze] (Porifera, Hexactinellida, 
Hexasterophora, Hexactinosida, Aphrocallistidae) w ; ere col¬ 
lected from Saanich Inlet and Barkley Sound, British Co¬ 
lumbia (Canada) by scuba diving. They were a gift of Dr. 
Sally P. Leys (Department of Biology, University of Vic¬ 
toria, P.O. Box 1700, Victoria, BC, Canada). The material 
was immediately frozen in liquid nitrogen until use. 

Construction of cDNA library from A. vastus 

Total RNA was extracted from sponge tissue, and poly- 
adenylated mRNA was isolated from total RNA as already 
described (Pfeifer et al, 1993a and b). cDNA was prepared 
with a ZAP Express cDNA synthesis kit. The cDNA library 
of A. vastus was prepared in Hybri ZAP11 (Stratagene) and 
packaged in vitro with the MaxPlax Packaging Extract 
(Epicentre Technologies). The library contained approxi¬ 
mately 2.4 X 10 6 independent plaque forming units (pfu); 
the amplified library was stored at 4°C. 

Screening and isolation of the cDNAs encoding InsR-like 
molecules 

The complete cDNAs as well as those encoding the 
catalytic domains were cloned by the polymerase chain 
reaction (PCR) from the A. vastus cDNA library (see 
above), the S . domuncula (Kruse et al, 1997), or the S, 
raphanus cDNA libraries (Kruse et al, 1997). The degenerate 
sense primer 5'-TTYGGlATGGTITAYGARGG-3' (Y = py¬ 
rimidine, R — purine, 1 = inosine) and the downstream primer 
(anti sense) 5'-TARTARTClGTYTCRTADATRTC-3' were 
designed against the conserved regions of TK subdomain 1 
(FGMVYEG) and TK subdomain Vll (D1YETDY) of InsRs 
as well as 1GF-1-Rs from mammalian species; these regions 
are different from the corresponding protein kinases of other 
classes (Scavo et al, 1991). The two primers define a 
470-490 bp long sequence encoding pail of the TK cata¬ 
lytic domain (Fig. 1). The PCR was carried out using a 
Gene Amp 9600 thermal cycler (Perkin Elmer), with an 
initial denaturation at 95°C for 3 min, then 35 amplification 
cycles each at 95°C for 30 s, 50°C for 45 s, 72°C for 1.5 
min, and a final extension step at 74°C for 10 min. The 
reaction mixture of 50 /xl included 20 pmol of the respective 
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degenerate primer and 10 pmol of the primer T7 (Strat- 
agene), 200 fxM of each nucleotide, 1 /xl of the respective 
cDNA libraries, buffer, and 2.5 units of Taq DNA polymer¬ 
ase. The expected amplified products were purified and 
concentrated using Geneclean Spin Kit and directly ligated 
into pGEM-T vector. After isolation and purification, the 
plasmid DNAs were sequenced with an automatic DNA 
sequenator [Li-Cor 4200]. 

The TK catalytic domains of the three S. raphanus InsR- 
like molecules were used and completed by both 5'- and 
3'-RACE, using the kits “5'-" and kk 3'-RACE System" to 
full-length cDNAs. 

Sequence analyses 

Sequences were analyzed using PC/GENE, release 14.0, 
from IntelliGenetics, Mountain View, CA (USA). Similar¬ 
ity searches and sequence retrieval were performed via the 
e-mail servers at the European Bioinformatics Institute, 
Hinxton Hall, UK (BLITZ and FASTA), and the National 
Institutes of Health, Bethesda, MD, USA (BLAST). The 
phylogenetic tree was constructed from an aa alignment by 
the neighbor-joining method (Saitou and Nei, 1987) apply¬ 
ing the PHYLIP package version 3.5c program (Felsenstein, 
1993). The degree of support for internal branches was 
further assessed by bootstrapping. The distance matrix was 
calculated as described (Dayhoff et al, 1978). Multiple 
alignments were performed with CLUSTAL W version 1.6 
(Thompson et aL, 1994) and their graphic presentations by 
the program GeneDoc (Nicholas and Nicholas, 1997). 

Northern blot 

RNA was extracted from liquid-nitrogen-pulverized 
sponge tissue with TRl/ol Reagent (GibcoBRL) as recom¬ 
mended by the manufacturer. Total RNA (1 /xg) was elec- 
trophoresed through formaldehyde/agarose gel and blotted 
onto Hybond N 1 membrane following the manufacturer’s 
instructions (Amersham). Hybridization experiments were 
performed with the probes SRINRJ, SRINR2, or SRINRJ 
[^600 bp segments] from S. raphanus. These probes were 
labeled with DIG-1 I-dUTP by the DIG DNA labeling kit. 
Hybridization was performed with the anti sense D1G- 
labeled probes at 42°C overnight using 50% formamide 
containing 5XSSC, 2% blocking reagent (Boehringer], 7% 
(w/v| SDS, and 0.1% [w/v] N-lauroylsareosine, following 
the instructions of the manufacturer (Boehringer]. After 
washing, DIG-labeled nucleic acid was detected with anti- 
DIG Fab fragments [conjugated to alkaline phosphatase] 
and visualized by a chemiluminescence technique using 
CDP, the chemiluminescence substrate for alkaline phos¬ 
phatase, according to the instructions of the manufacturer 
[Boehringer]. 


Results 

Cloning and sequencing the cDNAs encoding the 
InsR-like molecules 

The S. domuncula nt sequence, SDINR, is 491 nt long and 
has a potential open reading frame [ORF] of 489 bases 
encoding a deduced protein sequence of 163 aa residues. 
The sequence from A . vastus, AVINR, is 490 nt long with an 
ORF of 489 nt (163 aa). 

Three putative sequences of InsR-like molecules were 
isolated from the cDNA library of S. raphanus. The cDNA 
for type 1 InsR, SRINRJ, is 2026 nt long with an ORF 7 of 
1848 nt encoding a putative sequence of 616 aa (Figs. 1 and 
2); type 2 InsR, SRINR2, is 2150 nt long with an ORF of 
1842 nt (614 aa); and type 3 InsR (Fig. 2), SRINRJ, is 1433 
nt long with an ORF of 1368 nt (456 aa) (Fig. 2). Northern 
blot analyses were performed with these S . raphanus cDNA 
probes. One band each of approximately 2.2 kb (type I), 2.3 
kb (type 2), and 1.6 kb (type 3) were obtained, confirming 
that the full-length cDNAs were isolated (Fig. 3). 

Deduced aa sequences of the catalytic domains of the 
putative sponge InsRs 

The deduced aa sequences of the catalytic domains of the 
InsR-like sequences between subdomains I to VII have been 
aligned (Fig. 1). The borders of subdomains 1 to VII (ac¬ 
cording to Hardie and Hanks, 1995), could be defined for all 
sponge sequences unequivocally (Fig. I). Specific sites and 
sequence characteristics were also present as outlined ear¬ 
lier (Muller and Schacke, 1996): in subdomain 1, the ATP- 
binding site [consensus: GxGxxGxV; but in the hexactinel- 
lid INR„AV sequence G is replaced by R]; within 
subdomain II. the residue Lys in the consensus VAxK, 
which is required for kinase activity; within subdomain Vlb: 
the aa 13 [Asp] and N [Asn] as well as in subdomain VII: the 
DFG tripeptide is present. The DFG segment has been 
implicated in ATP binding (Hanks et al., 1988) and repre¬ 
sents the most conserved portion within the catalytic do¬ 
main. The tyrosine residue (Y) in subdomain VII (aa no. 
180 of the catalytic domain, with respect to the G. cydoninm 
RTK) undergoes phosphorylation and is the tyrosine kinase 
phosphorylation site. Signatures within subdomains VI11. 
IX, X, and XI are generally less well conserved. Therefore, 
the PCR-based sequencing was restricted to the part within 
subdomains I to VII. The TK-specific active-site signature, 
D-L-A-T/A-R-N, characteristic for both vertebrate and in¬ 
vertebrate TKs (Ottilie et al., 1992; Hanks et al., 1988) is 
found in subdomain Vlb. Within the subdomain VII the 
signature for the TK class II receptors with the consensus 
pattern is found, D-[LlV]-Y-x r Y-Y-R (PC/GENE, 1995 
[Prosite]). 

The PCR primers were chosen to identify, in sponges, 
those catalytic domains of class II RTKs that share the 
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highest similarity to InsRs. 1GF-1-Rs, to InsR-related rccep 
tors, and to the insulin receptor homolog from D. melano- 
gaster (Fernandez et cil ., 1995). These receptors have the 
consensus within the class 11 signature ol R-D-[lV]-\ -E- 
[TSJ-D-Y (Hardic and Hanks, 1995). As seen in Figure 1, 
this consensus is, as expected, present in all sponge se¬ 
quences; therefore the sequences from the demosponge 5. 
domuncula , the calcareous sponge S. raphanus , and the 
hexactinellid sponge A. vastus were termed InsR-like mol¬ 
ecules. 

Complete an sequences of the InsR-like sequences 
from S. raphanus 

Three cDNAs encoding complete putative InsR-like se¬ 
quences from S . raphanus have been isolated from the 
library. The sequences are termed type 1, SRINRI , type 2, 
SRINR2 , and type 3, SRINR3, InsR-like molecules. The 
putative 616 aa sequence INR_SR1 (deduced from 
SR1NR1) has a calculated M r of 69,477; 1NR_SR2 of 614 aa 
has an M r of 69,213, and 1NR_SR3 of 456 aa has an M r of 
51,259. 

The sequence 1NR_SR2 was selected tor the analysis 
given here. The transmembrane segment, determined ac¬ 
cording to the program ^RAOARGOS" (PC/GENE, 1995) 
ranges from aa I81 to aa 196 . The intracellular domain is, as in 
other RTKs (Hardie and Hanks, 1995), divided into a jux- 
tamembrune domain (aa 197 to aa 242 ) and the catalytic do¬ 
main [TK domain] (aa 243 to aa 521 ) (Fig. 2). The catalytic 
domain is subdivided into 12 subdomains and contains the 
characteristic TK-specific active-site signature and the RTK 
class 11 signature (see above); in addition, the putative 
ATP-binding site (Hanks et al. , 1988) is present (Fig. 2). 

The extracellular domain contains one calcium-binding, 
epidermal growth factor receptor [EGF]-like domain that 
reads D-x-N-E-C 1 -D-x 5 -C“-D-E-C -Q-N-C 4 -x-N-x (1 -C -x- 
N-x 3 -C 6 -D; it is located from aa l29 to aa 162 [the Cys resi¬ 
dues are numbered consecutively]. This EGF-like domain 
consists of six Cys residues, flanked by aa with carbonyl 
oxygen atoms, which are arranged slightly differently from 
those found in molecules from higher Metazoa. In particu¬ 
lar, the Cys 4 and Cys 5 are separated by more than one aa 
(Bork et al. . 1996). Furthermore, an incomplete EGF-like 
domain is present from aa 49 to aa l28 . The two other types ol 
InsR-like molecules from S. raphanus also have two EGF- 
like domains, and they are similarly arranged. This finding 
is the first demonstration that EGF-like domains are present 
in the lowest metazoan phylum. Until now, this domain, 
which is widely found in vertebrate receptors— e.g., mam¬ 
malian epidermal growth factor receptors (Geer et al. , 1994) 
and matrix proteins like fibulin (Pan et ah, 1993)—has only 
been identified among invertebrates in Caenorhahditis el- 
egans (Campbell and Bork, 1993). 


Phylog enetic ana lyses 

When the deduced aa sequences of the TK catalytic 
domains from the three sponge species were analyzed using 
the programs BLITZ, FASTA, and BLAST, they displayed 
highest similarity to the polypeptides from both inverte¬ 
brates and vertebrates. Among invertebrates, these domains 
were most similar to the insulin-like receptors from the 
insects Aedes aegypti and D. melanogaster, as well as to the 
insulin receptor of the mollusc Aplysia californica . In addi¬ 
tion an InsR-homolog sequence isolated, so far. from one 
cephalochordate, Branchiostoma lanceolatum, as well as 
InsRs, 1GF-1-Rs, or InsR-homologs from selected verte¬ 
brates (human, mouse and rat) were highly similar to the 
sponge sequences. They share about 40%—45% ol identical 
aa and about 60%-65% of similar aa (including identical 
aa) with the selected corresponding molecules. Taking only 
the sponge sequences, the sequence from S. raphanus, type 
1, is identical in 67% of the aa (similarity of 78%) within the 
catalytic domain with A. vastus and in 69% (79%) with S. 
domuncula. The finding that the three sequences obtained 
from S. raphanus differ considerably from each other is 
interesting; type 1 shares only 75% identical aa (similarity 
86%) with type 2 and only 79% identical aa (similarity 
88%) with type 3. 

The phylogenetic tree was constructed and rooted with 
the sequence of the catalytic domain of the Fes/FER non¬ 
receptor TK domain from S. raphanus (Cetkovic et al ., 
1998; Fig. 4A). All sequences used were cut for the align¬ 
ment to obtain the 12 subdomains, comprising approxi¬ 
mately 300 aa. All of the sponge sequences fall into one 
branch of the tree, whereas the seleeted sequences of InsRs, 
IGF-l-Rs, or InsR-related sequences from invertebrates and 
vertebrates are grouped together into a second one. This 
relationship is statistically very robust as analyzed by boot¬ 
strapping. Hence, support for monophyly of Porifera can be 
deduced. In consequence, the presented findings, based on 
the data obtained with the catalytic domains of the InsR-like 
molecules from sponges, shed new light on the assumed 
uncertain position of sponges as reviewed by Rodrigo et al. 
(1994). In addition, the data given do not support earlier 
notions which suggested that the phylum Porifera might be 
paraphyletic (Cavalier-Smith et al., 1996). 

Rate of evolution of the catalytic domains of sponge 
InsR-like molecules 

Use of our data collected on the percentage of aa identity 
among the polypeptide sequences from the different sponge 
species on one side and the sponge sequences in comparison 
to those from higher metazoan allows a relative approach to 
determining the time of divergence of the sponge classes 
from a common ancestor. This estimation, which is based on 
the number of point mutations per 100 aa within given 
polypeptides, might reflect the time of divergence of two 
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Figure I. Alignment of the catalytic domains of the insulin-like receptors as well as of the related sequences 
within the class 11 tyrosine kinase receptors (RTKs), The deduced amino acid sequences of InsR homologs from 
the polypeptides of the three classes of the phylum Porifera (sponges) were aligned with the related sequences 
tor three invertebrates (mosquito, fruit fly, and mollusc); one ccphalochordate (amphioxus); and the three 
vertebrates (human, mouse, rat). In addition, the RTK domain from the sponge Geodia cydonium was used for 
the comparison. Specific sequence codes are identified below. The borders of the subdomains [1 to VllJ are 
shown in the figure below the sequences; the nomenclature follows that of llardie and Hanks (1995). Also 
marked are the tyrosine protein kinase specific active-site signature (Tyr-signature [--]), the TK class 11 
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taxa. The evolutionary rates—expressed as ^-values— 
vary between different proteins (Zuckerkandl and Pauling, 
1965; Kimura, 1983; Li et aL, 1987). In a previous study, 
the galectin protein from the sponge G. cydonium (Pfeifer et 
a /., 1993b) was calculated to have an estimated evolutionary 
rate of 0.97 X 1CT 9 aa substitutions/site/year (Hirabayashi 
and Kasai, 1993); a value of 1.24 X 10 9 was calculated for 
the RTK from G. cydonium (Schlicke et aL, 1994b) from the 
same animal. 

Dating based on the molecular clock is inaccurate be¬ 
cause its rate often varies. If we accept this insecurity, reject 
the estimated evolutionary rate from sponge genes, and 
accept the one calculated from the time of protostome- 
deuterostome divergence—700 MYA (Dayhoff 1978)—we 
can postulate the time of separation of the sponges from the 
common metazoan ancestor, as follows. It we take the 
calculated £ aa -value for the human to D . melanogaste (0.46) 
as a reference for the protostome-deuterostome split, then 
the hexactinellid sponge A. vastus branched oft 1400 MYA 
(£ aa -value of 0.92), followed by the demosponge 5. donum - 
cula 1300 (A aa -value of 0.84) and the calcareous sponge S. 
raphanus 1200 MYA for type 1 and 2 (A aa -value of 0.80) 
and for type 3 1100 MYA (A aa -value of 0.77). Recent fossil 
data show (Li et aL 1998) that sponges existed in much 
their present form 580 MYA (Fig. 4B). 

Discussion 

We have shown that all three classes of the phylum 
Porifera express molecules related to InsR; and these mol¬ 
ecules display, in their extracellular domains, EGF-like 
sequences (as shown here for 5. raphanus ). This finding 


implies that animals of the lowest metazoan phylum already 
contain growth factor receptors that allow them to react to 
nutrient cues and also to neighboring, individual cells, with 
a complex intracellular signaling reaction. The lnsR-ho- 
mologs, which are putative transmembrane receptors, pre¬ 
sumably allow the transduction of signals through the cel¬ 
lular membrane. Usually signaling by RTKs involves 
ligand-mediated receptor dimerization (Geer et aL, 1994), a 
process that has not yet been studied in Porifera. InsRs, 
1GF-1-Rs, and InsR-related receptors or InsR-homologs ot 
higher metazoan taxa do not contain, in their extracellular 
loops, EGF-like domains, but rather cysteine-rich regions 
(Geer et aL, 1994). This finding underlines again previous 
findings, that most polypeptides deduced from the cDNA 
sequences of sponges are assembled by an unusually large 
variety of modules. For one example, the putative sponge 
aggregation receptor is composed of scavenger receptor 
cysteine-rich domains as well as of short consensus repeats 
(Pancer et aL, 1997b; Blumbach et aL, 1998) in a structural 
complexity not known in higher Metazoa. 

From the evolutionary point of view, the present contri¬ 
bution makes three points. First, it establishes that mole¬ 
cules similar to the InsR-homologs have evolved prior to the 
“Cambrian Explosion." Suga et aL (1997) suggested that 
most of the PTK subfamilies, including InsRs, diverged by 
domain shufflings, together with gene duplications before 
the diploblast-triplobast split. As a result of recent findings 
that the Porifera already existed before this event (Li et aL, 
1998), we can assume that this class of key molecules, 
involved in the complex network of intracellular signaling, 
could have been one major driving force that allowed the 


signature [specific for InsRs and related sequences] (<—class It —* [InsRj) as well as the TK phosphorylation site 
(P). The positions of the primers are indicated ([ + + + + ]) above the sequences. Identical aa residues in all 15 
sequences are shown in white-on-black, and residues conserved in at least eight sequences are shaded. 


Vertebrates 

IG1R HUMAN 
1NSRJ1UMAN 
INSR_MOUSE 
1GF_RAT 
1NSR_RAT 
Cephalochordate 
ILPR.BRALA 
Invertebrates 
1NSR_AEDAE 
INR_DROME 
INSR_APLA 


Human insulin-like growth factor 1 receptor precursor (X04434) 

Human InsR precursor (P06213) 

InsR precursor house mouse (Mus musculiis: PI5208), 

IGF-I-Rl receptor precursor rat ( Rattits nonegicus: A33S37) 

InsR precursor from rat (/?. non'egicus; PI5127) 

Insulin-like peptide receptor precursor amphioxus ( Branchiostoma lanceolatum; 002466) 

Insulin-hke receptor precursor mosquito {Aedes aegypti; Q931U5) 

InsR homolog fruit fly ( Drosophila melanogaster; U28I36) 

InsR the mollusc ( Aplxsia californica; 1587845) 


Sponges 

INR_SD 

INR.SRl 

INR_SR2 

1NR_SR3 

1NR_AV 

RTK.GC 


Insulin-like receptor demosponge Suberites domuncula 
Insulin-like receptor calcareous sponge Sycon raphanus type I 
Insulin-like receptor calcareous sponge S. raphanus type 2 
Insulin-like receptor calcareous sponge S. raphanus type 3 
Insulin-like receptor hexactinellid sponge Aphrocallistes vastus 
Receptor tyrosine kinase demosponge Geodia cydonium (X77528) 
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Figure 2. Alignmeni of ihe deduced au sequences of the InsR-like sequences from Sycon raphcuius type 1 
{INR_SR 1), type 2 (INR_SR2). and type 3 (1NR_SR3). The four segments of the sequences are the extracellular 
domains (EC domain); the iransmembrane segments (TM); and the two intracellular domains, the juxtamem- 
brane domain (JM domain) and the catalytic domain (TK domain). The TK domain is subdivided into the 12 
subdomains (above the sequence) with the characteristic TK-specific active-site signature (Tyr-signature 
. )|U ] rjk c j ass i| signature (♦—class II—*) (below the sequence); in addition, the putative ATP- 
binding-site (ATP) is marked In the extracellular domain, the conserved Cys residues (arrowhead) of the two 
EGF-like domains (EGF-like) are indicated. Identical amino acid residues are shown in white-on-black type, and 
residues conserved in at least two sequences are shaded. 


K1JC 1UCTAISNS -LKii* IrL VHlMLfc IlUiwILLLVf TSLLAKoIKS IKPHKAl AS S YHHNDV 
DHTSLKLRGRTENQGHNVSTWLWTNANDSYYPLPSHV- 


b lb 

614 


other metazoan phyla to arise. Second, the phylogenetic 
analyses conhrni lhat, based on the autapomorphic character 
for Metazoa, the RTKs, sponges as a taxon are monophy- 


letic; the Hexactinellida have been calculated to be the 
oldest class, followed by the Demospongia and finally by 
the Calcarea. Third, EGF-like domains arc already present 
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SRINR 

type: 2 1 3 

kb 



Figure 3. Northern blot analyses to determine the sizes of the tran¬ 
scripts of the mRNA encoding the Sycon raphanus InsR-like molecules 
{SRINR) type 1 {SRINR J), type 2 (SRINR2), and type 3 (SR1NR3). RNA 
was prepared from sponge tissue and 1 /xg each was subjected to analysis. 
Molecular masses of marker RNAs, which were run in parallel, are given 
on the right (in kilobytes'). 
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PORIFERA 
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common metazoan ancestor 


Figure 4. (A) Rooted phylogenetic tree of the catalytic domains of the 

sequences listed in Figure 1. The numbers at the nodes refer to the levels 
of confidence as determined by bootstrap analysis. The scale bar indicates 
an evolutionary distance of 0.1 amino acid substitution per position in the 
sequence. The catalytic domain of the Fes/FER nonreceptor TK domain 
from Sycon raphanus (Fes/FER_SR, Y17051) was used as the outgroup 
sequence. (B) Proposed branching order of the three classes of the phylum 
Porifera (Hexactinellida, Demospongiae, and Calcarea) from a common 
metazoan ancestor. The dates of the approximate times of divergence are 
indicated. 


in sponges, where I hey were inserted into poieniial cell 
surface receplors and also into matrix molecules. 
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